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ABSTRACT 

The VSOP mission is a Japanese-led project to study radio sources with sub- 

>• ' milliarcsecond angular resolution using an orbiting 8-m telescope, HALCA and 

00 ' 

^ ' global arrays of Earth-based telescopes. Approximately 25% of the observing 
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time has been devoted to a survey of compact AGN at 5 GHz which are stronger 
than 1 Jy — the VSOP AGN Survey. This paper, the second in a series, describes 
the data cahbration, source detection, sclf-cahbration, imaging and modehng, 
and gives examples iUustrating the problems specific to space VLBI. The VSOP 
Survey web-site which contains all results and calibrated data is described. 

Subject headings: galaxies: active — radio continuum: galaxies — surveys — 
techniques: interferometric 

1. Introduction 

On 1997 February 12, the Institute of Space and Astronautical Science (ISAS) launched 
the HALCA satellite carrying an 8 m radio telescope dedicated specifically to Very Long 
Baseline Interferometry (VLBI). With an apogee height of 21400km, radio sources are able 
to be imaged with angular resolution three times greater than with Earth-based arrays at 
the same frequency (Hirabayashi et al. 1998). About 25% of the observing time to date has 
been dedicated to the VSOP AGN Survey of ^ 400 flat-spectrum AGN which are stronger 
than IJy at 5 GHz. Observations from the VLBA Pre-Launch Survey (hereafter VLBApls, 
Fomalont et al. 2000b) revealed that 294 of these sources demonstrated compact structures 
suitable for observations with HALCA, and these were included in the VSOP Source Sample 
(VSS). (This number was initially reported as 289 [Hirabayashi et al. (2000b)] but increased 
to 294 when it was found that the use of low accuracy positions had initially resulted in 
5 other sources not being detected [Edwards et al. (2002)].) The compilation and general 
description of the VSOP AGN Survey is given by Hirabayashi et al. (2000b) (Paper I) and 
Fomalont et al. (2000a). The major goal of the Survey is to determine statistical properties 
of the sub-milliarcsecond structure of the brightest extragalactic radio sources at 5 GHz, and 
to compare these structures with other properties of the sources. Combined with ground 
observations at many radio frequencies (single-dish and VLBI) and at higher energies, the 
Survey will provide an invaluable source list for detailed ground-based studies, as well as list 
of sources for future space VLBI missions. 

In this paper, the second in the VSOP Survey series, we describe the data calibration 
and imaging procedures adopted for the VSOP Survey Program. These procedures are 
sufficiently different from more conventional VLBI data reduction because of the relatively 
poor phase stability and low signal-to-noise inherent in space VLBI. Paper HI (Scott et al. 
2004) presents results for the first 102 sources and Paper IV (Horiuchi et al. 2004) contains 
a statistical analysis using the visibility data for sources with 5 > —44°. 
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In §2 we briefly review the correlation of VSOP data. In §3 we discuss the cahbration 
procedure, and in §4 we outline the self-calibration, imaging and modeling of the sources. 
Finally, in §5 we describe the VSOP web site and its access. 

2. Correlation of VSOP Data 

The VLBI Space Observatory Programme (VSOP) was described in detail by Hirabayashi 
et al. (2000a,b). For the Survey, the VLBI wavefront data are recorded in the standard 
HALCA continuum mode at each participating ground telescope, and HALCA data arc sim- 
ilarly recorded by one or more of the five tracking stations (see Paper I). The delays in the 
downlink from the spacecraft to each tracking station were also monitored at the tracking 
stations (Hirabayashi et al. 2000a). Four recording formats have been used in VSOP obser- 
vations: VLBA (Napier et al. 1994), MklV (Whitney 1999), S2 (Carlson et al. 1999) and 
VSOP (Shibata et al. 1998). For many Survey observations, a mixture of recording formats 
are used at the tracking stations and ground telescopes; in these cases, special-purpose de- 
vices located at the VSOP correlator in Mitaka, Japan are used to translate the data to a 
common format, which is then supplied to the appropriate correlator. The majority of the 
Survey experiments were correlated with the S2 Correlator (DRAO, Penticton, BC, Canada) 
until 2002 August. The VSOP correlator (NAOJ, Mitaka, Japan) has been used for some 
of the observations which included the ground telescopes at Usuda and Kashima, and has 
been used for all observations after 2002 August. The VLBA correlator (NRAO, Socorro, 
NM, USA) was used until early 2002 for many of the General Observing Time (GOT) ex- 
periments in VLBA and Mk4 formats, from which a sub-set of the data was extracted for 
use in the Survey (Hirabayashi et al. 2000b). Data are exported from each correlator in a 
format appropriate for initial reduction in the NRAO AIPS package (Greisen 1988). 

The correlator output consists of typically 128 frequency channels in each of two 16 MHz 
bands. The VLBA and Penticton correlators produced data at time-samphngs of 0.5 seconds 
and 2.0 seconds for space-ground and ground-only basehnes respectively while the Mitaka 
correlator produces data at a 1 second samphng on all basehnes. Thus the data have sufficient 
resolution to search for fringes within a window spanning a residual delay of ±4/isec and a 
residual phase-rate of 1 Hz. This corresponds to a position error of 500 m and velocity error 
of 3 cm/s for the HALCA satellite, significantly larger than the nominal errors of the orbit 
determination (Hirabayashi et al. 2000a). 

The translation integrity and the relative amplitude scaling of the three correlators 
were checked using the results of several three-hour experiments in which a strong source 
was observed using three ground stations, two of which could record data simultaneously 
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in two formats. The data were processed through all three correlators (with format trans- 
lations made as necessary), and the results were compared. First, it was found that the 
translation process did not change the correlated amplitude by more than 2%, except when 
there were clear indications of recording problems. Second, the comparison of the visibility 
amplitudes for the same experiment processed by each correlator established the relative 
correlator amplitude scale factor to an accuracy of 3% (G. A. Moellenbrock et al. 2002, 
private communication) . 

3. Data Calibration and Detection 

The reduction of VSOP Survey observations is being undertaken by a global effort 
of astronomers with an interest in high-resolution imaging. Therefore, a set of reduction 
procedures has been developed to ensure, as much as possible, that the Survey results are 
internally consistent. 

The reduction procedure consists of two main parts. The first part, covered in this 
section, consists of initial calibration and fringe-fitting, and is performed using the NRAO 
AIPS package. The second part, covered in §4 consists of self-calibration, imaging and model- 
fitting, and is performed using the Caltech Difmap package (Shepherd 1997). The following 
sections describe these steps in detail with AIPS tasks and Difmap commands indicated by 
text in the SMALL CAPS style. 

3.1. Preliminctries and a priori Calibrations. 

The correlator distribution data for each experiment is imported into NRAO AIPS 
using the task fitld. The datasets are sorted, indexed, and documented using standard 
AIPS tasks (msort, indxr, listr, prtan, dtsum). Except for datasets correlated in 
Penticton, it is necessary to run ACCOR to remove fringe normalization errors arising from 
potentially non-optimal sample populations among the four 2-bit voltage levels recorded at 
each telescope. 

For a priori amplitude calibration, system temperature and gain information supplied 
by each telescope are imported into the AIPS database using ANTAB. Then, APCAL is used 
to form the VSEFD cahbration factors required to scale each antenna's gain^. For HALCA, 



"'^The System, Equivalent Flux Density (SEED) is the ratio of the system temperature (K) and telescope 
gain (K/Jy). It concisely describes the sensitivity of a radio telescope and the geometric mean of SEFD 
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the nominal 5 GHz system temperature is ~ 90 K and stable within an observation to ~ 5%. 
Its SEFD was monitored early in the mission using total power observations of Gas A, Gyg A 
or Tau A and found to be relatively constant. The 5 GHz gain is 0.0062 K/Jy and this yields 
a HALCA SEFD of ~ 14, 500 Jy, which is more than an order of magnitude larger than most 
ground telescopes. The a priori amplitude calibration value and reliability from the ground 
telescopes varies considerably, and can be in error by 30% for telescopes which are only 
occasionally used for VLBI. 

As VSOP observations arc made with global arrays of ground telescopes, it is not un- 
common for some telescopes to be observing at frequencies in the 5 GHz band offset from 
the their standard frequencies, i.e., the frequencies at which the nominal gain is measured 
and monitored. HALGA's 5 GHz system noise temperature varies by almost 15% across 
the 4700-5000 MHz band (Kobayashi et al. 2000) and so Survey observations are generally 
scheduled at the frequencies where (the least sensitive telescope) HALGA's performance was 
best. Use of nominal gain and nominal system temperature values, or even measured sys- 
tem temperature values if these were measured at the standard frequencies rather than the 
actual observing frequencies, also contributed to the overall uncertainties in gain calibration 
of VSOP Survey observations. Further amplitude corrections are discussed in the next sec- 
tions, and Paper III describes a more accurate post-facto determination of the amplitude 
cahbration of the Survey sources as a whole. 



3.2. Fringe- fitting 

Fringe-fitting, the process by which the correlated signals are detected, is the most 
important part of the Survey reductions. Unlike most ground-only VLBI, fringe detection 
for HALCA observations is difficult due to the hmited sensitivity of the orbiting telescope, 
the generally lower correlated flux densities on long baselines and the larger uncertainty in the 
spacecraft's location and clock. (The "spacecraft clock" is the hydrogen maser at the tracking 
station in use at the time, however the corrections required to correct for the downlink of the 
data introduced uncertainties in addition to those encountered for ground radio telescopes 
[Hirabayashi et al. 2000a].) This combination of conditions requires fringe searches for weak 
signals over large ranges of delay and fringe rate, hence the need for high time and frequency 
sampling. The small number of ground telescopes typical of Survey observations limits the 
sensitivity for global fringe-fitting as well (Gotton 1995). It is therefore important to limit 



for two telescopes and provides the proper scaling factor to convert normalized correlation coefficients to 
Janskys. 
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the range of the search in delay and fringe rate as much as possible in order to keep the 
fringe searching efficient, and to avoid false detections. For many Survey datasets, delay and 
fringe rate solutions are available from fringe searches performed for data quality analysis 
at the correlators. Application of these delay and phase-rate offsets using CLCOR before 
more detailed fringe searching allows for significant data averaging and smaller fringe-search 
windows. For datasets with strong fringes for only a portion of the observation, the resulting 
narrower search should, in principle, allow detection of the weaker fringes. In practice, 
however, these gains have been modest. 

For most observations, the AIPS task FRING is used for fringe-fitting. Solution intervals 
of up to 10 minutes (approaching the coherence time) are attempted to maximize the Signal- 
to- Noise Ratio (SNR). For strong sources, solution intervals as short as 2 minutes can be used 
as long as the SNR is greater than about 5. Detections are best gauged by consistency in the 
delay and fringe rate solutions between the two independent frequency channels (see Figure 
1). For the weakest sources (few or no detections in the correlator's data quality search), 
the AIPS task KRING was used since it allows larger searches and longer integrations than 
FRING for the same computer resources. Most of the editing of the data was obvious from 
the loss of detection during the fringing process, and from the telescope logs. 

After an adequate fringe-fitting solution is obtained, the combined calibration was ap- 
plied using SPLIT or SPLAT, which also averages the corrected data in frequency within each 
16 MHz band, and to a common 2-second integration time. The data were then stored using 
FITTP or FITAB for subsequent processing. 

4. Determining the Source Structure 

In their analysis of VSOP observations of a complete sample of northern sources with 
very good {u,v) coverage on ground-only baselines, Lister et al. (2001) found that the 
dynamic range of a VSOP image is limited by poor sampling of the {u, v) plane on ground- 
space baselines. They found that the true dynamic range is between 30:1 and 100:1 depending 
on source complexity. In the case of VSOP Survey data this problem is amplified by the 
smaller number of ground radio telescopes and great care must be taken in interpreting the 
data. Although every effort was made to include long ground basehnes in the scheduled 
array, this was not always achievable in practice. In general, VSOP Survey data provides a 
general idea of structure such as core sizes and intensities and basic jet properties such as 
position angle and location with respect to the core. 

It is therefore important for the data analyst to keep these limitations in mind when 
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working with this sparsely sampled {u, i')-data with low SNR. Each stage of the data re- 
duction must be checked in order to obtain a source structure which is consistent with the 
limitations in the data as well as incorporating a priori information about the source struc- 
tures, either from the ground-only baselines or from other ground-based VLBl observations 
of the source. The Difmap software package was chosen for this part of the processing since 
it provides a good interface for viewing data, as well as visibility-plane model-fitting and 
image deconvolution facilities. 

4.1. First-Pass Editing and Checking 

The data from the AlPS calibration (2 s sampling in two single-channel frequency bands) 
are read into Difmap and averaged to a 30 s grid. The weights arc calculated as the reciprocal 
of the data variance, which is proportional to the inverse square of the RMS. The data are 
then phase self-calibrated with a point source model on a 30 s timescale to determine the 
telescope-based phase as a function of time. Further data editing is based on several criteria: 
(1) Obvious outliers in a plot of amplitude versus projected {u, T;)-distance are removed using 
RADPLOT; (2) Periods of low visibihty amplitude for any antenna are found using vplot; 
(3) Periods of very poor phase stability (indicating that the source was not detected during 
this period) can be seen using CORPLOT. 

Although the a priori gain calibration for each telescope is made in AIPS using the 
nominal gain and system temperatures, large residual, gain errors of up to 30% often persist. 
For this reason, observations of additional compact sources by the ground telescopes are 
scheduled (typically during gaps in HALCA tracking) and used to better constrain the gain 
values for each telescope. These calibrators have known structures from the VLBApls catalog 
(Fomalont et al. 2000b) and their flux densities monitored from observing programs at the 
University of Michigan^ and at the Austraha Telescope Compact Array (Tingay et al. 2003). 

4.2. Self-Calibration and Imaging Iterations 

Since most Survey experiments have limited (-u, f )-coverage, imaging and/or model- 
fitting requires the introduction of constraints to the size and complexity of the radio emission 
in order to obtain accurate deconvolution and self-calibration. The first step is to make a 
relatively low resolution image without the HALCA data. These images provide the best 



^http: / / www.astro.lsa.umich.edu/obs/radiotel/unirao.html 
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sensitivity to extended structure and help identify regions in the field where the most compact 
structure was likely to be located. The VLBApls catalog image, made from observations in 
1996 (Fomalont et al. 2000b), as well as other pre-existing images of the sources (including 
ground observations at 15 GHz which had similar resolution of the VSOP Survey observations 
[Kellermann et al. 1998; Gurvits et al. 2004, in preparation]) are also useful in determining 
the constraints needed to image the VSOP Survey data. For about 5% of the VSOP Survey 
observations, it is clear that the visibility amplitude on the shortest projected baselines is 
much lower than that known from pre-existing VLBI observations, even considering possible 
variability of sources. This indicates that a problem has occurred at one or more antennas 
during the observation or during the tape copying process (if it was required) or during 
correlation. If the problem can not be rectified the observations are considered corrupted, 
and placed back into the VSOP Survey observing schedule for another observation. In cases 
such as these the data are processed, often as a ground-only observation, as they may still 
provide useful information. 

The next imaging step includes all of the data to obtain an approximate image. For 
most sources, the (u, f )-coverage is sufficient to use CLEAN, followed by a phase-only self- 
calibration to improve the phase calibration. Several phase self-calibration iterations are 
generally made for each source. For sources with extremely poor (m, t')-coverage, model- 
fitting the data with one or two Gaussian-shaped components is used instead of the CLEAN 
deconvolution. In some cases a hybrid approach, using CLEAN components for the ex- 
tended emission and models for the small-diameter components, is used. The use of various 
data weightings to emphasize or de-emphasize the longer VSOP baselines depends upon 
the strength and size of the source and the number of visibilities on ground-only baselines 
compared to ground-space baselines. To obtain images that best reveal the ~ 0.1 mas scale 
a weighting scheme is used for which the space-ground baselines contribute about 50% of 
the effective data. As an illustration of the importance of increasing the data weights on 
space baselines, we present the results of fitting a simple model to the VSOP Survey obser- 
vations of 3C345 on 1998 July 28 (Figure 2). The modelfit procedure in Difmap applies 
a weight of l/u^ to each visibility point. When the weights are calculated in this way the 
sensitive ground-only visibilities dominate the fit and the ground-space baselines have little 
infiuence. However, if the HALCA data are upweighted, in this case by a factor of 25 so 
that ground-space and ground-only visibilities have roughly equal weighting, the fit improves 
significantly. 

For most Survey experiments, amplitude self-calibration is not used due to lack of closure 
constraints and/or limited sensitivity of the space-baselines. In the cases where the data 
from four or more telescopes are sufficiently strong, amplitude self-calibration using GSCALE 
provides a scale factor for each telescope over the entire observation. In some cases, amplitude 
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self-calibration over a time-scale of one hour is possible. 

4.3. Image Representation 

A satisfactory image is generally obtained after three or four phase self-calibration loops 
and perhaps one amplitude calibration. Such a quick convergence is due to the limitations 
on the achievable image fidelity characterised by a sparse {u, v) coverage, and a lack of 
short spacings in particular. The latter typically contain most of the information on com- 
plex, extended structures. For most sources two representations of the source structure are 
available: the CLEAN image, and a model-fit image. For small-diameter components with 
poor {u, f )-covcrage, the model representation is more reliable than the CLEANcd image. 
In some cases, the CLEAN components more accurately reproduce the extended emission, 
while the Gaussian model component describes the small- diameter components more accu- 
rately. Both representations of the source structure should be in relatively good agreement, 
and satisfactorily fit the observed {u, ^;)-data. 

The model representation of the source structure isolates important parameters of the 
components which are necessary to determine angular sizes and brightness temperatures. 
Even for images in which the CLEAN algorithm was used to determine the source structure, 
the models are chosen carefully, starting simple and moving to more complicated structures. 
The goal is to fit the observed visibilities within the uncertainties using the smallest number of 
model parameters, and to duplicate the structure found by cleaning. The following guidelines 
are used in choosing model components: 

1. The number of components is kept to a minimum. 

2. Simple components are preferred to complex ones, i.e. a point model is better than a 
circular Gaussian model which is better than an elliptical Gaussian component. 

3. If an elliptical component becomes linear during model fitting it was generally an 
indication that the sampling of the {u, i')-plane was poorly constrained in the direction 
perpendicular the the component's major axis. In these circular Gaussian 
component is favored. 

4. In general, additional or more complex components are used only if the data or the 
image require it. 

The calibrated data, models and cleaned image are then saved in the NRAO AIPS 
UVFITS format using Difmap's SAVE command. These data can be read back into Difmap 
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for further processing, imaging and modeling, and are available through the VSOP Survey 
Data Base Web site. 



5. VSOP Survey Data Base 

Once a data analyst has completed the reduction of an experiment, the calibrated data, 
reduction notes and supporting files (from both AIPS and Difmap) are uploaded to a com- 
puter at IS AS. Information from the uploaded files is entered into a database and published 
on the VSOP Survey web page (http://www.vsop.isas.jaxa.jp/survey). Displays of the fi- 
nal calibrated visibilities, images (CLEAN and model-fit), and the model fit parameters are 
available. Documentation of the data processing, from the initial calibration, to the fringe- 
fitting, to the imaging and modeling, are given for each experiment. The calibrated data 
are available from the web site and the original post-correlation data can be obtained upon 
request, but this dataset is much larger (of the order of 100 Mb compared to ~ 100 kb). 

6. Conclusion 

This paper, the second in the VSOP 5 GHz AGN Survey series, has described the 
data processing used to construct the images and determine models from the VSOP Survey 
Program. Because of the uniqueness of the Space VLBI Survey data, we have described 
many of the procedures in some detail since they are different from normal VLBI reduction 
practices. Enhancement of the data procedures and the development of new algorithms 
(especially for detecting weak sources) are needed for further space VLBI missions. 
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and A.R.T. wish to acknowledge support from the Canadian Space Agency. 
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IF Rate Difference vs UTC Time 
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Fig. 1. — An example of the difference in fringe rate solutions between the two bands for a 
typical Survey experiment. Fringes to HALCA are detected until nearly 1/01 at which time 
the differences become randomly distributed on a scale exceeding the fringe rate resolution 
(~3mHz FWHM). In this case the fringe rate search window was restricted which is why 
the rate differences are constrained after fringes are lost. 



-14- 



3C345 at 4.816 GHz In LL 1998 Jul 28 



3C345 at 4.816 GHz in LL 1998 Jul 28 




UV radius (10° A) 




UV radius (lO" A) 



Fig. 2. — Model fits to the visibility data from a VSOP Survey observation of 3C345. Visi- 
bility amplitudes (+ symbols) and model visibilities (solid points) are plotted as a function 
of {u,v) radius. Left: the result of a fit using the standard l/o"^ weighting and right: a 
model fit with the weight on HALCA data increased by a factor of 25. 



